Data Mining: Cause or Consequence

نویسنده

  • Wagner Meira
چکیده

Data mining arose as a merge of several areas such as databases, statistics and artificial intelligence, and has been growing steadily in the last 20 years. Recently, the popularization of the concepts of Data Science and Big Data accelerated the process. In this seminar we try to answer the question whether data mining is cause or consequence of these recent developments through an integrated view of four key components of data mining research and development, nominally models, algorithms, systems and applications, and how they are employed in scenarios such as internet and web. We will also discuss some trends related to knowledge and information discovery from massive data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cause-Consequence Modeling of Occupational Accidents in Construction Sites: A Retrospective Study in Iran

Introduction: Nearly half of occupational accidents in Iran occur in construction sites. Therefore, modeling of occupational accidents in these sites is one of the solutions to design safety strategies to reduce occupational accidents in the field of construction. This study was designed and conducted with the aim of modeling the cause-consequence of accidents in construction sites. Material a...

متن کامل

Improvement of Rule Generation Methods for Fuzzy Controller

This paper proposes fuzzy modeling using obtained data. Fuzzy system is known as knowledge-based or rule-bases system. The most important part of fuzzy system is rule-base. One of problems of generation of fuzzy rule with training data is inconsistence data. Existence of inconsistence and uncertain states in training data causes high error in modeling. Here, Probability fuzzy system presents to...

متن کامل

Association rule mining application to diagnose smart power distribution system outage root cause

Smart grid has been introduced to address power distribution system challenges. In conventional power distribution systems, when a power outage happens, the maintenance team tries to find the outage cause and mitigate it. After this, some information is documented in a dataset called the outage dataset. If the team can estimate the outage cause before searching for it, the restoration time will...

متن کامل

Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)

Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...

متن کامل

Determination of optimal bandwidth in upscaling process of reservoir data using kernel function bandwidth

Upscaling based on the bandwidth of the kernel function is a flexible approach to upscale the data because the cells will be coarse-based on variability. The intensity of the coarsening of cells in this method can be controlled with bandwidth. In a smooth variability region, a large number of cells will be merged, and vice versa, they will remain fine with severe variability. Bandwidth variatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015